Extracting topics in texts: Towards a fuzzy logic approach

نویسندگان

  • Mohand Boughanem
  • Henri Prade
  • Ourdia Bouidghaghen
چکیده

The paper presents a preliminary investigation of potential methods for extracting semantic views of text contents, which go beyond standard statistical indexation. The aim is to build kinds of fuzzily weighted structured images of semantic contents. A preliminary step consists in identifying the different types of relations (is-a, part-of, related-to, synonymy, domain, glossary relations) that exist between the words of a text, using some general ontology such as WordNet. Then taking advantage of these relations, different types of fuzzy clusters of words can be built. Moreover, apart from its frequency of occurrence, the importance of a word may be also evaluated through some estimate of its specificity. The size of the clusters, the frequency and the specificity of their words are indications that enable us to build a fuzzy set of sets of words that progressively "emerge" from a text, as being representative of its contents. The ideas advocated in the paper and their potential usefulness are illustrated on a running example. It is expected that obtaining a better representation of the semantic contents of texts may help to better retrieve the texts that are relevant with respect to a given query, and to give some indication of what the text is about to a potential reader.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Impact of Structural Components of Market on the Markup Level Based on Radial Basis Neural Network and Fuzzy Logic

This paper aims to evaluate the impact of several indices of market structure including entry to barrier, economies of scale and concentration degree on 140 active industries using the digit. Accordingly, we apply three methods including cost disadvantages ratio ( ), Herfindahl–Hirschman concentration index ( ) and Comanor and Willson criterion in order to assess the economies of scale and usin...

متن کامل

FUZZY LOGISTIC REGRESSION BASED ON LEAST SQUARE APPROACH AND TRAPEZOIDAL MEMBERSHIP FUNCTION

Logistic regression is a non-linear modification of the linearregression. The purpose of the logistic regression analysis is tomeasure the effects of multiple explanatory variables which can becontinuous and response variable is categorical. In real life there aresituations which we deal with information that is vague innature and there are cases that are not explainedprecisely. In this regard,...

متن کامل

Applications of Fuzzy Program Graph in Symbolic Checking of Fuzzy Flip-Flops

All practical digital circuits are usually a mixture of combinational and sequential logic. Flip–flops are essential to sequential logic therefore fuzzy flip–flops are considered to be among the most essential topics of fuzzy digital circuit. The concept of fuzzy digital circuit is among the most interesting applications of fuzzy sets and logic due to the fact that if there has to be an ultimat...

متن کامل

Controlling Electrochemical Machining By Using a Fuzzy Logic Approach

New trends and the effect of key factors influence the quality of the holes produced by ECM processes. Researchers developed a fuzzy logic controller by adding intelligence to the ECM process. Maintaining optimum ECM process conditions ensures higher machining efficiency and performance. This paper presents the development of a fuzzy logic controller to add intelligence to the ECM process. An e...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008